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MocLified cytochrome P450 monooxygenases 

The present invention relates to modified cytochrome P450 
5 monooxygenases with an altered substrate profile, to nucleic acid 
sequences coding therefor, to expression constructs and vectors, 
to recombinant microorganisms which comprise these vectors, and 
to procel^ses for the microbiological production of terminally or 
subterminally hydroxylated aliphatic carboxylic acids* 

10 

The monooxygenase with the name P450 BM-3 is a cytochrome P450 
enzyme from Bacillus jnegaterium and has a pronounced sequence 
homology with mammalian F450 enzymes (!)• Owing to these 
correspondences, P450 BM-3 constitutes an excellent model system 

15 for this class of P450 enzymes. P450 BM-3 hydroxy lates mainly 

long-chain saturated fatty acids on their co-l, aj-2 and cd-3 carbon 
atom. Amides or alcohol analogs and epoxides of long-chain 
unsaturated fatty acids are also converted (1-3). The catalytic 
activity for saturated fatty acids depends on the chain length, 

20 the chain length optimum . being 14 to 16 carbon atoms. The enzyme 
shows no catalytic activity for fatty acids with a chain length 
of less than 12 carbon atoms ( 1 ) • 

SummarY invention 

25 

It is an object of the present invention to provide cytochrome 
P450 monooxygenase mutants which show a modified substrate 
profile in comparison with the wild-type enzyme. In particular, 
it was an object to provide novel mutants which hydroxy late 

30 saturated aliphatic carboxylic acids in another chain position 
and/or have a modified substrate specificity. In peorticular, it 
was an object to provide mutants which have a catalytic activity 
for aliphatic carboxylic acids of medium chain length, in 
particular with a chain length of 8 to 12, such as, for example, 

35 8 to 10, carbon atoms, and which hydroxy late these carboxylic 

acids subterminally, especially at the positions co-1, o>-2 and/or 
cj>-3. 

We have found that this object is achieved by providing modified 
40 cytochrome P450 monooxygenases which, owing to a combination of 
directed evolution and site-specific mutagenesis of their 
substrate-binding region, show an altered reactivity pattern or 
substrate profile in the terminal and/or subterminal enzymatic 
hydroxylation of aliphatic carboxylic acids in comparison with 
45 the wild type. 
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Conipared to the wild-type enzyme, an "altered substrate profile- 
IS observed in mutants according to the invention. An "altered 
5 substrate profile" means, for the purposes of the present 
invention, a) an improved reactivity such as, for exaniple, an 
increase in specific activity (expressed as nmol reacted 
carboxylic acid/minute/nmol P450-en2yine) and/or at least one 
kinetic parameter selected from amongst Kcat, Km and Kcat/Km, for 
XO example an improvement of at least 1%, such as, for example, 10 
to 1000%, 10 to 500», or 10 to 100%, of the mu;ant forTi;ast 

d^Jl^^^'^'^'f ''^^ aliphatic carboxylic acid or a hydroxylatable 
derivative of an aliphatic carboxylic acid, and/or b) an altered 
15 the'^c^^f increased, regioselectivity in the hydroxylation of 
^^^/^^^y^^<'.«*=^<»- -^here is thus, for example, a shift of the 
preferred terminal or subterminal ««-l, u)-2, a>-3, «>-4 , in 
particular co-1 to co-3) hydroxylation position at at least one 
hydroxylatable carboxylic acid or a hydroxylatable derivative of 
20 ^" acid. Hydroxylatable aliphatic carboxylic 

20 acids or derivatives thereof in which an "altered substrate 
profile- can be observed in accordance with the invention are 
branched or, preferably, straight-chain carboxylic acids having 8 . 
to 30 caxbon atoms. The alteration according to the invention of 
the substrate profile may be manifested over the entire length 
<^-«- C8-C30) or only in sections, for example in the case of 
7«H ""i C"-C,2-, C12-C30-, C12-C25- or c,2-C2o carboxylic acids or 
individual carboxylic acids from amongst these sections. 

Nonlimiting examples which may be mentioned of carboxylic acids 
30 which can be hydroxylated in accordance with the invention are- 
caprylic acid, pelargonic acid, capric acid, undecanoic acid, 
lauric acid tridecanoic acid, myristic acid, pentadecanoic acid, 
palmitic acid, margaric acid, stearic acid, nonadecanoic acid, 

35 m«?i?^^" ^""t"' acid, cerotic acid and 

Tt l ° . ^^^""P^"^ °^ ^uizable carboxylic acid derivatives 

are Ci-C4-alkyl esters, amides or anhydrides with, preferably, 
short-Cham C1-C4 -carboxylic acids. 

The monooxygenases according to the invention are preferably 
40 derived from cytochrome P450 monooxygenases from the enzyme class 
E.G. 1-14.- -, in particular from the P450 family CYP102, and are 
Of eukaryotic or prokaryotic, in particular bacterial, origin. 

45 ^J^!;*^"^^"^ preferred group of mutants is derived from Baoillus 
45 inegaterium cytochrome P450 monooxygenase bm-3 with an amino acid 
sequence as shown in SEQ ID NO; 2, which has at least one 
functional mutation in one of the following amino acid sequence 
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regions: 24-28, 45-51, 70-72, 73-82 (helix 5), 86-88 (helix 6), 
172-224 (F/G loop) and 352-356 (p-strand 8), with the proviso 
that, if the enzyme carries the mutation F87A, more than one of 
these regions is mutated; and functional equivalents of these 
5 mutants. 

In the present description, the mutation in question is shown in 
the amino acid one-letter code. The original amino acid is shown 
before the number which indicates the sequence position of the 
10 mutation, while the modified amino acid is shown after ^e 
number* 

A ''functional mutation" for the purposes of the present invention 
encompasses an aimino acid exchange in the abovementioned sequence 
15 regions which results in an "altered reactivity pattern" or 
''altered substrate profile", in accordance with the above 
definition. 

Especially preferred in accordance with the invention are P450 
20 BM-3 monooxygenase mutants (group (A)) which comprise 
individually or in combination in at least each case one 
functional mutation in the amino acid sequence regions 86-88, 
172-224, 73-82, 45-51 and 24-28 (in accordance with SEQ ID NOs 

2)- 

25 

Thus, for example, Phe87 may be replaced by an amino acid with an 
aliphatic side chain such as, for example, Ala, Val, Leu, in 
particular Val or Ala; Iieul88 may be replaced by an amino acid 
with an amine or amide side chain such as, for example, Asn, Gin, 

30 Arg or, in particular, Lys, or amino acids such as Ala, Gly, Ser 
and Trp; Ala74 may be replaced by another amino acid with an 
aliphatic side chain, such as, for example, Val and, in 
particular, Gly; Arg47 may be replaced by an amino acid with a 
cyclic side group such as, for exan^le. His, Tyr or, in 

35 particular, Phe; and Val26 can be replaced by an amino acid with 
a hydroxy 1 side group such as, for example, Ser or, in 
particular, Thr. 

Preferred group (A) mutants exhibit at least one of the following 
40 amino acid substitution patterns: 

F87V; 

F87A, L188K; 
F87V, L188K; 
F87A, I-188K; A74G; 
F87V, L188K, A74G; 
F87A, I«188K, A74G, R47F; 



a) 
b) 
c) 
45 d) 

e) 

f) 
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g) F87V, I1I88K, A74G, R47F; 

h) F87A, Iil88K^ A74G, R47F, V26T; or 

i) F87V, L188K, A74G, R47F, V26T; 

5 and functional equivalents thereof. 

A further group, group (B), of suitable mutants exhibits a single 
amino acid substitution in one of the abovementioned sequence 
regions or in one of the sequence regions 70-72 and 352-356. In 
10 the two last-mentioned sequence regions^ Ser72, for example, may 
be replaced by an amino acid with an aliphatic side chain such 
as, for example, Ala, Val, Leu, He and, in particular, Gly, and 
Met354 may be replaced by an amino acid with a hydroxyl side 
group such as, for example, Ser or, in particular, Thr. 



15 



In particular, mutants of group B exhibit an amino acid 
substitution from amongst the following: 

a) V26T, 

20 b) R47F, 

c) S72G, 

d) A746r 

e) F87V, 

^) L188Z, where z is K, R, w, Q, n, G, A or S, 
25 and 

g) M354T; 

and functional equivalents thereof. 

30 ''Functional equivalents" are to be understood as meaning in 

accordance with the invention mutants which exhibit, in at least 
one of the abovementioned sequence positions, an amino acid 
substitution other than the one mentioned specifically, but still 
lead to a mutant which, like the mutant which has been mentioned 

35 specifically, show an ''altered substrate profile" in accordance 
with the above definition over the wild type. Functional 
equivalence exists in particular also in the case where the 
alternations in the reactivity pattern correspond qualitatively. 

40 "Functional equivalents" naturally also encon^ass P450 

monooxygenase mutants which, like the P450 BM3 mutants which have 
been mentioned specifically, can be obtained by mutating P450 
enzymes from other organisms. For example, regions of homologous 
sequence regions can be identified by sequence comparison. 

45 Following the principles of what has been set out specifically in 
the invention, the modern methods of molecular modeling then 
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allow equivalent: muta-tions to be carried out which affect the 
reaction pattern., 

'^Functional equivalents" also encompass the mutants which can be 
5 obtained by one or more additional amino acid additions r 

substitutions, deletions and/or inversions, it being possible for 
the abovementioned additional alterations to occur in any 
sequence* position as long as they give rise to a mutant with an 
''altered substrate profile'' in the above sense « 

10 

The present invention furthezrmore relates to nucleic acid 
sequences encoding a mutated monooxygenase or a "functional 
equivalent" in accordance with the above definition. These 
sequences can preferably be obtained from SEQ ID NO: 1 by codon 
15 exchange in accordance with the above amino acid substitution 
patterns. 

The invention also encompasses those nucleic acid sequences which 
encompass so-called silent mutations or which are altered in 

20 comparison with a specifically mentioned sequence in accordance 
with the codon usage of a specific origin or host organism, and 
naturally occurring variants of such nucleic acid sequences. Also 
encompassed by the invention are deviations of the nucleic acid 
sequences obtained by the degeneracy of the genetic code (i.e. 

25 without altering the corresponding amino acid sequence) or by 
conservative nucleotide substitution (i.e. the corresponding 
amino acid is replaced by another amino acid with the same 
charge, size, polarity and/or solubility), and also sequences 
altered by the addition, insertion, inversion or deletion of 

30 nucleotides and which encode a monooxygenase according to the 
invention with an "altered substrate profile", and the 
corresponding complementary sequences. 

The invention furthermore relates to expression constructs 
35 comprising a nucleic acid sequence encoding a mutant according to 
the invention under the genetic control of regulatory nucleic 
acid sequences; and vectors encompassing at least one of these 
expression constructs. 

40 Preferably, the constructs according to the invention encompass a 
promoter 5 '•upstream and a terminator sequence 3 ' -downstream of 
the encoding sequence in question, and, if appropriate, other 
customary regulatory elements, in each case operatively linlced 
with the encoding sequence. Operative linkage is to be understood 

45 as meaning the sequential arrangement of promoter, encoding 
sequence, terminator and, if appropriate, other regulatory 
elements in such a manner that each of the regulatory elements 
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can fulfill lt:a intended function on expression of the encoding 
sequence. Examples of operatively linkable sequences are 
targeting sequences, or else translation enhancers, 
polyadenylation signals, selection xoaricers, amplification 
5 signals, replication origins and the like. 

In addition to the artificial regulatory sequences, the natural 
regulatory sequence can still be present upstream of the actual 
structural gene. Xf desired, this natural regulation may also be 

10 switched off by genetic alteration, and the expression of the 

genes may be enhanced or lowered. However, the gene construct may 
also be simpler in construction, i.e. no additional regulatory 
signals are inserted upstream of the structural gene and the 
natural promoter with its regulation is not removed. Xnstead, the 

15 natural regulatory sequence is mutated in such a way that 
regulation no longer takes place and the gene expression is 
increased or reduced. One or more copies of the nucleic acid 
sequences may be present in the gene construct. 

20 Examples of promoters ares cos, tac, trp, tet, trp-tet, Ipp, lac, 
Ipp-lac, laclq, T7, T5, T3 , gal, trc, ara, SP6, 1-PR or l-PL 
promoter, all of which are advantageously employed in 
Grcun- negative bacteria; and the Gram-positive promoters amy and 
SP02, the yeast promoters ADCl, Mra, AC, P-60, CYCl, GAPDQ or the 

25 plant promoters CaMV/35S, SSU, OCS, lib4, usp, STLSl, B33r nos, 
or the ubiquitin or phaseolin promoter. 

Xn principle, all natural promoters with their regulatory 
sequences can be used. Xn addition, synthetic promoters may also 
30 be used in an advantageous fashion. 

The abovementioned regulatory sequences are intended to allow the 
directed expression of the nucleic acid sequences. Depending on 
the host organism, this may mean, for exsunple, that the gene is 
35 expressed or overexpressed only after induction has taken placed, 
or that it is expressed and/ or overexpressed immediately. 

The regulatory sequences or factors can preferably have a 
positive effect on expression and in this manner increase or 
40 reduce the latter. Thus, an enhancement of the regulatory 

elements may advantageously take place at the transcriptional 
level by using strong transcription signals such as promoters 
and/or "enhancers", in addition, translation may also be enhanced 
by improving, for example, mRNA stability. 

45 
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An expression cassette is generated by fusing a suitable promoter 
with a suitable monooxygenase nucleotide sequence and a 
terminator signal or polyadenylation signal. To this end, 
customary recombination and cloning techniques are used as they 
5 are described, for excui^le, by T. Maniatis, E.F. Fritsch and J. 
Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1989) and by T.J. 
Silhavy, * M.L. Berman and L.w. Enquist, Experiments with Gene 
Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor^ NY 
10 (1984) and by Ausubel, F.M. et al.. Current Protocols in 
Molecular Biology, Greene Publishing Assoc. and Wiley 
Interscience (1987). 

For expression in a suitable host organism, the recombinant 
15 nucleic acid construct or gene construct is advantageously 

inserted into a host-specific vector which allows optimal gene 
expression in the host. Vectors sltb well known to the skilled 
worker and can be found, for example, in ''Cloning Vectors" 
(Pouwels P. H. et al., Ed., Elsevier, Amsterdam-New York-Oxford, 
20 1985). Vectors are to be understood as meaning not only plasmids, 
but all other vectors known to the skilled %iorker such as, for 
exan^le, phages, viruses such as SV40, CMV, baculovirus and 
adenovirus, transposons, IS elements, phasmids, cosmids, and. 
linear or circular DKA. These vectors can be replicated 
25 autonomously in the host organism or chromosomal ly. 

The vectors according to the invention allow the generation of 
recombinant microorganisms which can be transformed, for example, 
with at least one vector according to the invention and which can 

30 be employed for producing the mutants. The above-described 
recombinant constructs according to the invention can 
advantageously be introduced into a suitable host system and 
expressed. It is preferred to use usual cloning and transfection 
methods known to the skilled worker, for example coprecipitation , 

35 protoplast fusion, electroporation, retroviral transfection and 
the like, in order to bring about expression of the 
abovementioned nucleic acids in the expression system in 
question. Suitable' systems are described, for example, in Current 
Protocols in Molecular Biology, F. Ausubel et al., Ed., Wiley 

40 Interscience, New York 1997. 

SuiteJOle host organisms are, in principle, all organisms which 
allow expression of the nucleic acids according to the invention, 
their allelic variants, and their functional equivalents or 
45 derivatives. Host organisms are to be understood as meaning, for 
example, bacteria, fungi, yeasts or plant or animal cells. 
Preferred organisms are bacteria such as those of the genera 
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Escherichia such as^ for example, Escherichia coli, Streptomyces , 
Bacillus or Pseudomonas, eukaryotic microorganisms such as 
Saccharomyces cerevisiae, Aspergillus, and higher eukaryotic 
cells from animals or plants, for example Sf9 or CHO cells 

5 

If desired, expression of the gene product may also be brought 
about in transgenic organisms such as transgenic animals such as, 
in particular, mice, sheep, or transgenic plants. The transgenic 
organisms may also be knock-out animals or plants in which the 
10 corresponding endogenous gene has been eliminated, such as, for 
example, by mutation or partial or complete deletion. 

Successfully transformed organisms may be selected by means of 
marker genes which are also contained in the vector or in the 
15 expression cassette. Examples of such marker genes are genes for 
resistance to antibiotics and genes for enzymes which catalyze a 
color reaction causing the transformed cell to be colored. They 
may then be selected by means of automatic cell sorting. 
Microorganisms which have been transformed successfully with a 
20 vector and which carry a suitable gene for resistance to 

antibiotics (for exanqple G418 or hygromycin) may be selected by 
suitable liquid or solid media containing antibiotics. Marker 
proteins presented on the cell surface may be exploited for 
selection by means of affinity chromatography. 



25 



The combination of the host organisms and the vectors which match 
the organisms, such as plasmids, viruses or phages, for example 
plasmids with the RNA polymerase/promoter system, the phages X. ^ 
or other temperate phages or transposons and/or further 
30 advantageous regulatory sequences, forms an expression system. 
The term -expression system" is to be understood as meaning, for 
example, the combination of mammalian cells, such as CHO cells, 
and vectors, such as pcDNA3neo vector, which are suitable for 
mammalian cells. 



35 



40 



As described above, the gene product may advantageously also be 
expressed in transgenic animals, for example mice, sheep, or 
transgenic plants.- Equally, it is possible to program cell-free 
translation systems with the RNA derived from the nucleic acid. 

The invention relates furthermore to processes for the 
preparation of a monooxygenase according to the invention, in 
which a monooxygenase-producing microorganism is cultured^ 
monooxygenase expression is induced, if appropriate, and the 
45 monooxygenase is isolated from the culture, in this manner, the 
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monooxygenase according to the invention may also be produced on 
an industrial scale, if this is desired. 

The microorganism can be cultured and fezrmented by known methods. 
5 Bacteria, for example, can be multiplied in TB or LB medium at a 
temperature of 20 to 40^0 and a pH of 6 to 9. Suitable culture 
conditions are described in detail for example by T. Maniatis, 
E.F. Fritsch and J. Sambrook, Molecular Cloning i A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
10 (1989). 

Unless the monooxygenase is secreted into the culture medium, the 
cells are then interrupted, and the monooxygenase is obtained 
from the lysate by known protein isolation methods. The cells may 
15 be disrupted as desired by high-frequency ultrasound, by high 
pressure, such as, for example, in a French pressure cell, by 
osmolysis, by the action of detergents, lytic enzymes or organic 
solvents, by homogenizers, or by combining a plurality of the 
methods listed. 

20 

The monooxygenase may be purified by kno%#n methods o£ 
chromatography, such as molecular sieve chromatography (gel 
filtration), such as chromatography on Q-Sepharose, ion-exchange 
chromatography and hydrophobic chromatography, and by other 

25 customary methods such as ultrafiltration, crystallization, 

salting out, dialysis and native gel electrophoresis. Suitable 
methods are described, for exan^le, by Cooper, F.G., Biochemische 
Arbeitsmethoden [Methods in Biochemistry], Verlag Walter de 
Gruyter, Berlin, New York or by Scopes, R. , Protein Purification,. 

30 Springer Verlag, New York, Heidelberg, Berlin. 

To isolate the recombinant protein, it is especially advantageous 
to use vector systems or oligonucleotides which extend the cDNA 
by certain nucleotide sequences and thus encode altered 

35 polypeptides or fusion proteins for the purposes of simpler 
purification. Such modifications which are suitable are, for 
example, tags which act as anchors, for example the modification 
known as hexa-histidine anchor or epitopes which can be 
recognized as antigens of antibodies (described, for example, by 

40 Haxlow, E. and Lane, D., 1988, Antibodies: A X*aboratory Manual. 
Cold Spring Harbor (N.Y.) Press). These anchors can serve to 
attach the proteins to a solid support such as, for example, a 
polymer matrix, which may be filled into a chromatography column 
or used on a microtiter plate or on any other carrier, for 

45 example • 
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At the same time, these anchors may also be used for recognizing 
the proteins. To recognize the proteins, it is additionally 
possible to use conventional labels, such as fluorescent dyes, 
enzyme labels which, after reaction with a substrate, form a 
5 detectable reaction product, or radiolabels, alone or in 
combination with the anchors for derivatizing the proteins. 

The invention furthermore relates to biochemical processes for 
the enzymatic production of terminally or subterminally (co-l to 
10 u)-4) hydroxylated aliphatic carboxylic acids, which comprise 

al) aerobically culturing a recombinant microorganism according 
to the invention in the presence of a culture medium which 
contains at least one hydroxylatable carboxylic acid or at 
15 least one hydroxylatable carboxylic acid derivative; or 

a2) aerobically incubating a reaction medium containing at least 
one hydroxylatable carboxylic acid or at least one 
hydroxylatable carboxylic acid derivative with a mutant 
20 according to the invention, and 

b) isolating the resulting hydroxylated product or product 
mixture from the medium. 

25 The carboxylic acids can be employed in the process according to 
the invention per se or as derivatives thereof, such as, in 
particular, Ci-C4-alkyl esters or carboxamides . 

Hydroxylatable carboxylic acids which are preferably employed in 
30 the process according to the invention are Ca-Cao-monocarboxylic 
acids or derivatives thereof. 



A hydroxylatable carboxylic acid which is especially preferably 
used in the process according to the invention is a 
35 CB-Ci2-monocarboxylic acid or a derivative thereof, and a 

monooxygenase which is especially preferably used in the procesi 
according to the invention is a mutant in accordance with the 
above group (A) . 

40 In accordance with another process according to the invention, 
the hydroxylatable carboxylic acid used is a 
Ci2-C3o-monocarboxylic acid or a derivative thereof and the 
monooxygenase is a mutant selected from amongst the single 
mutants F87A, F87V, V26T, S72G, A74G and M354T, and the multiple 

45 mutants f) to i) in accordance with the above group (A). 
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The oxidation reaction according to the invention is usually 
carried out in the presence of atmospheric oxygen and in the 
presence of an electron donor (or reduction equivalent) such as, 
in particular, MADH, NADPH and Zn/Co(IXX) sepulchrate. The use 
5 of Zn/Co(III) sepulchrate is described, for example, in 
DE-A-199 35 115 which is herewith expressly referred to. 

If the hydroxylation according to the invention is carried out 
with a recombinant microorganism, it is preferred first to. 

10 culture the microorganisms in the presence of oxygen and in a 
complex medium such as, for example, TB or IiB medium, at a 
cultivation temperature of approximately 20 to 40«>C and a pH of 
approximately 6 to 9 until a sufficient cell density is reached. 
Exogenous substrate is added as required. In order better to 

15 control the oxidation reaction^ the use of an inducible, in 

particuleir a temperature-inducible, promoter is preferred. Here, 
the temperature is raised to the induction temperature required, 
for example 42*>C in the case of the Pr^i promoter, maintained over 
a sufficient period, for example 1 to 10 or 5 to 6 hours, to 

20 express the monooxygenase activity, and the temperature is then 
dropped to. a value of approximately 30 to 40^. Culturing is then 
continued for 12 hours to 3 days in the presence of oxygen. 

If, In contrast, the oxidation according to the invention is 
25 carried but with purified or concentrated enzyme, the enzyme 
mutant according to the invention is dissolved in a medium 
comprising exogenous substrate (approximately 0.01 to 10 mH, or 
0.05 to 5 mM), and the reaction is ceurried out, preferably in the 
presence of oxygen, at a temperature of approximately 10 to 50^, 
30 such as, for example, 30 to 40oc, and a pH of approximately 6 to 9 
(such as, for example, adjusted with 100 to 200 mM phosphate or 
Tfis buffer), and in the presence of a reducing agent, during 
which process the substrate-comprising medium furthermore may 
exhibit an approximately 1 to 100-fold molar excess of reduction 
35 equivalents, based on the substrate to be oxidized. If required, 
the reducing agent may be added portionwise. 

If required, oxygen may be passed into the reaction medium in a 
manner known per se. 

40 

If the processes according to the invention are carried out with 
enriched or purified enzyme, for example the following reaction 
conditions may be set: 



45 Substrate concentration: 
Enzyme concentration : 
Reaction temperature: 



0.1 to 20 mg/ml 
0.1 to 10 mg/ml 
20 to 40OC 
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pHs 6 to 8 

Buffer: 0.05 to 0.2 M potassium 

phos phate , Tr is / HCl 

Electron donor: is preferably added portionwise 

(initial concentration 
approx. 0.1 to 2 mg/ml) . 



Before the reaction is started by adding the electron donor, it 
is possible to increase the activity by adding acetone in a 
10 concentration of 1 to 5% (v/v) and briefly preincubating ( 1 to 5 
minutes at approximately 20 to 40oc) . 

The skilled worker may deviate from these conditions and optimize 
the reaction conditions in question by routine experiments. 

15 

Two different strategies exist in protein engineering for 
generating variants with modified enzymatic properties: firstly, 
the ''rational design** method (4), and secondly, the ''directed 
evolution" method (5-7). Of central importance for successful 

20 rational design are a highly resolved three-dimensional 
structural model and an in-depth knowledge of the enzyme 
mechanism. While the rational design has been shown to generate 
enzyme mutants which have a high activity for unnatural 
substrates ( 4 > , the effect of individual amino acid point 

25 substitutions on stability, activity and specificity of the 
mutants can frequently not be predicted. 

Even through X-ray structures of P450 BM-3 have been deposited in 
the protein database (8), the conformation of substrate and P450 

30 during the critical hydroxy lation stage remains unclear (9-11). 
Of central importance for the directed-evolution method is an 
efficient detection method for the rapid screening of large 
libraries of random mutants. An optical test for the P450 BM-3 
mutant F87A, which employes u)-para-nitrophenoxycarboxylic acids 

35 (pNCA), has proved to be useful (12, 13). The single mutant F87A 
shifts the hydroxylation position of fatty acids from position 
co-1, <a-2 and o-3 towards the <o-position (13) and additionally 
leads to a complete conversion of 12-pNCA compared with a 33% 
conversion observed for the wild-type enzyme (12). However, since 

40 the monooxygenase domain of P450 BM-3 encompasses over 400 amino 
acids, the screening of a random-mutant library is like looking 
for a needle in a haystack. 

To improve the efficiency of the searching procedure, the 
45 rational-design method was combined in accordance with the 
directed-evolution method so as to generate mutants with a 
modified reactivity, in particular for generating mutants with 
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l:he specificity for fa-t-by acids with medium chain length. The 
''rational evolution" method according to the invention is based 
on a computer-aided protein modeling to generate a virtual random 
library and to identify residues which are highly likely to 
5 affect the desired properties. A sublibrary is generated by 
randomizing these residues and screened for positive mutants. 
Starting from a structural model, the first step is the 
determination of residues which are potentially important for 
chain- length specificity. To generate mutants with improved 

10 properties, the mutation site proposed by the model is modified 
with the aid of the saturation mutagens. Then, the individual 
mutants with the best properties are combined with each other 
taking into consideration the rational design. The application of 
this combined strategy made it possible in poorticular to improve 

15 the activity of P450 BM-3 for substrates of medium chain length. 

The application of this strategy and the sequence homology to 
P450 enzymes of different origins allow the skilled worker 
analogously to generate other mutants which are also the subject 
20 of the invention . 

The present invention will now be illustrated by the following 
experiment description and taking into consideration the appended 
figures. The figures show: 

25 

Figure 1: the stepwise optimization of P450 BM-3 for the new 
substrate 8-pHCA. The catalytic efficiency over 8-pNCA is shown 
for each mutant (in the unit s-^M-^); and 

30 Figure 2: a model of the complex of the P450 BM-*3-mutant I«ARVF 
with the substrate 8-pNCA. The side chains which show the five 
best positions for mutations and which were combined to give the 
mutant IiARVF are shown (V26T, R47F, A74G, F87V, Ij188K) . 

35 MefchQd It Modeling the substrate/P45Q BM^3 CQnq;>lex 

The modeling of the substrate/enzyme complex was based on the 
structure of P450 EiM-3 which had been determined by 
crystallography, complexed with palmitoleic acid (9). This 

40 structure is deposited at the protein database (8). The crystal 
structure described, at a resolution of 2.9 A, contains four 
molecules in an asymmetric unit cell. Chain A was selected as 
reference for the modeled con^lex. A model of 8-pllCA was 
generated as substrate molecule using the SYBTL ''Molecule 

45 Builder" (Tripos, Inc., St. Louis, USA). The F87A mutation was 
generated using the S7BYL Biopolymer Tool. The carbon atoms 1 to 
4 of the substrate were placed into the binding site matching the 
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carbon atoms 6 to 9 o£ the bound palmitoleic acid. The torsion 
angle of the fatty acid chain was chosen in accordance with the 
transconfiguration step. NMR studies on P450 BM-3/laurate 
complexes and P450 BM-3/12-bromolaurate complexes demonstrated 
5 that the protons of the hydroxylated carbon atoms (CIO and Cll) 
are at an approximate distance of 3.0 A from the heme Iron, which 
is in the II oxidation state (10, 13). As a result of this 
finding, the corresponding atoms C7 and C8 of 8-pNCA were placed 
at a distance of 4 A and 3.6 A, respectively, from the heme iron- 
ic The para-nitrophenoxy group was arranged manually in the binding 
pocket. Also, the energy of the complex was minimized by fixed 
backbone atoms . 

Method 2; Saturation mutai^^n^ai 

15 

The mutants used in accordance with the invention were generated 
with the aid of saturation mutagenesis using the Stratagene 
QuikChange Kit (La Jolla, California, USA). Nine positions in the 
vicinity of the substrate binding channel, viz. P25, V26, R47, 
20 Y51, S72, A74, F87, 1.188 and M354, were selected for mutation via 
P450 BM-3 modeling. The primers for each of these positions are 
compiled in Table 1 • 

Table 1 : 



25 





Selec— 
ted 
posi- 
tions 


Primers 


Sequence No. 


30 


P25 


5' 


-9ttattaaacacagataaannngttcaagctttgatg-3 ' 


SEQ 


ID 


NO: 


9 




5' 


-catcaaagcttgaacnnntttatctgl:gtttaataac-3 ' 


SSQ 


ID 


NOt 


10 




V26 


5' 


-gttattaaacacagataaaccgnnncaa9ctttgatg-3 ' 


SEQ 


ID 


NO: 


11 






5' 


-catcaaagcttgnnncggtttatctgtgtttaataac-3 ' 


SEQ 


ID 


NO: 


12 


35 


R47 


b' 


-cgaggcgcctg9tnnn9taacgcgctacttatc-3 ' 


SEQ 


ZD 


NO: 


13 






5' 


-gataagtagcgcgttacnnnaccaggcgcctcg-3 ' 


SEQ 


ID 


NO: 


14 




Y51 


5' 


-cctggtC9tgtaacgcgcnnnttatcaagtcagc-3 ' 


SEQ 


ID 


NO: 


15 


40 




5' 


-gctgacttgataannngcgcgttacacgaccagg-3 ' 


SEQ 


ID 


NO: 


16 


S72 


5' 


-gctttgataaaaacttannncaagcgcttaaatttgtacg~3 ' 


SEQ 


ZD 


NO: 


17 






5' 


-cgtacaaatttaagcgcttgnnntaagtttttatcaaagc-3 ' 


SEQ 


ID 


NO: 


18 




A74 


b' 


-gctttgataaaaacttaaagtcaannncttaaatttgtac9-3' 


SEQ 


ID 


NO: 


7 


45 




5' 


--cgtacaaatttaagnnnttgacttaagtttttatcaaagc-3 ' 


SEQ 


ZD 


NO: 


8 
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5 



L188 


5 ' -9aa9caatgaacaagnnncagc9a9caaatccag->3 ' 

5 '-C'tgga^'t1:gc'tC9Ct:gnnnc'tt:gttca'tt:9Ct.'bc-3 ' 


SEQ ID NO: 5 
SEQ ID NOt 6 


H354 


5 ' -ggcgacgaac'tannngt.'hc^ga'b'kcc'tcag-B ' 
5 ' -C'tgaggaateagaacnnnliag^t.cg'tcgcc-S ' 


SEQ ID NO 2 19 
SEQ ID NO: 20 


F87 

■ 


5 ' -gcaggagacggggttgnnnacaagctggacg-S ' 
5 ' -C9tccagctt.gi:nnncaaccccgtctcctgc->3 ' 


SEQ ID NOs 3 
SEQ ID NO: 4 



10 

The reaction conditions were identical for all mutagenic PCR 
processes, with the exception of the annealing temperature, which 
was varied as follows: SO^C for positions 25, 26, 188 and 354; 
52^0 for positions 47 and 51; and 46^ for positions 72, 74 and 

15 87. The reactions were carried out in reaction volumes of 50 ^1, 
each batch containing 17.5 pmol of each primer, 20 pmol of the 
template plasmid DNA, 3 U of the Pfu polymerase and 3.25 nmol of 
each dNTP. The reaction was started at 95^C, 4 minutes, and the 
batch then undergoes the following thermocycle 20 times: 95%, 

20 1 minute; 46-52<=h:, 2.5 minutes; 72%, 17 minutes; after these 

20 cycles, the reaction was continued for 15 minutes at 72% • To 
carry out a site-specific mutagenesis by PCR, an individual codon 
was altered for exchanging one amino acid. To randomize a 
specific amino acid, primers were used in which '^nnn" encodes the 

25 specific amino acid. All PCR product solutions were treated for 
3 hours at 37% with 20 U Dpnl to. digest the original unmutated 
template DNA. Transformation into E. CQli DH5a was then carried 
out. 

30 Method 3: Expression and purification of w lld^type enzyme and of 

the mutants 

The P450 BM-3 wild- type gene and its mutants wee expressed under 
the control of the strong temperature- inducible PrPi, promoter 

35 pCYTEXPl into E. coli strain DH5o (supE44, lacU169 [SOlacZ M15] 
h5dR17 recAl endAl gyrA96 thi-1 relAl) . The single point mutation 
F87A was introduced in accordance with the prior art (12). The 
transformed cells were plated onto LB agar plates which contained 
100 |Ag/ml ampicillin. After the colonies had grown for 12 to 

40 24 hours, they were picked up with sterile toothpicks and placed 
into 96-well microtiter plates, each well containing 200 \l1 of TB 
medium (12 g tryptophan, 24 g yeast extract, 4 ml glycerol, 
(distilled) H2O to 1 liter) together with 100 ^g/ml ampicillin. 
The plates were incubated overnight at 37^C. 40 were 

45 subsequently removed from each well and transferred into culture 
tubes which contained 2 ml TB medium with 100 |xg/ml ampicillin^ 
and the batches were subsequently incubated for 2 hours at 37^C 
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and t:hen for 6 hours at 42^C« The cells were removed by 
cen-trifugation for 5 minutes at 4000 rpm, treated with chicken 
albumen lysozyme (1 U/ml), and then twice frozen and defrosted. 
The crude cell extracts were obtained by centrif ugation for 
5 10 minutes at 14,000 rpm. The activity was measured in the 

resulting supernatant. To produce large quantities of enzyme, a 
2-1 shake flask with 300 ml of TB medium with 100 {ig/ml 
ampicillin was used, incubated at 37*^, shaken for 2 hours at 
200 rpm (OD57Bnjn ^-8 to 1.0) and subsequently incubated for 

10 6 hours at 42<^C- The cells were collected by centrif ugation for 
10 minutes at 4000 rpm and suspended in 15 ml of 0.1 M potassium 
phosphate buffer, pH 7.4. The ice-cooled suspension was disrupted 
with the aid of a Branson sonifier W25 ( Dietzenbach, Germany) 
(80 W, 2 minutes, 3 cycles). The suspension was centrif uged for 

15 20 minutes at 32 570 x g. The crude extracts were used for 
determining the enzyme activity or for enzyme purification* 

Enzyme purification was carried out as described in (14), but 
using a BioPilot chromatography (Pharmacia, Sweden) » Enzyme 

20 purity was determined by determining the total protein and the 
amount of enzyme . The concentration of purified enzyme was 
determined from the difference between the absorption spectrum of 
the carbonyl complex of the iron(II) form in comparison with the 
iron (XI) form using a molecular absorptivity of 91 mM--^ cm^^ for 

25 the wavelength pair 450 nm and 490 nm (1)« 

Method 4; Igolation of mutants with higher activity for 

substrates of shorter chain length 

30 Xnstead of the wild type, the P450 BM-3 mutant F87A was used as 
template DNA. The mutations for the position in question were 
generated as described above. In each case approximately 100 
colonies were picked for screening the sequence zone at each 
position and grown in culture tubes, and cells were isolated 

35 therefrom and lysed. The crude cell extracts from each selected 
colony were used for the activity test. All mutants which showed 
a higher activity than F87A for at least one substrate with a 
shorter chain length than 15-pNCA was sequenced for identifying 
the mutations. 

40 

The mutant with the highest activity for 12-pNCA, 10-pKCA or 
8-pNCA, respectively, amongst all mutants for the same position 
was selected for a later combination with other mutations « The 
combined mutation was carried out stepwise by site-specific 
45 mutagenesis. The combinatory procedure is shown in Figure 1. Six 
colonies were isolated from each combination step to determine 
the substrate specificity- A colony with a representative 
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Bub8l:rate speci£lcl1:y was selected, and the substrate specificity 
was determined for the pure enzyme . The plasmid of the selected 
colony was used for the next step of the site-specific 
mutagenesis. The mutations in the final mutant were identified by 
5 DNA sequencing (ABI PRISM^ BigDye™ Terminator Cycle Sequencing 
Ready Reaction Kit and ABI Prism™ 377 DNA Sequencer) . 

Mp>t-,hrid 5 * Engvme activity assav 

10 For the pNCA activity assay use was made of 8 jjil of a 10 mM pNCA 
solution in DMSO in a disposable cell in an end volume of 1 ml. 
After 850 |xl of Tris/HCl buffer (0.1 pH 8.2) and 0.1 to 
0.4 nmol F450 had been added to the pNCA/DMSO solution in 
question, the samples were preincubated for 5 minutes before the 

15 reaction was started by adding 50 ^1 of an aqueous solution of 
1 irVN NADPH. To determine Kcat and Km, a concentration series was 
established for the various pNCA substrates (regarding details of 
the detection method, see (12)). 

20 A 96-well plate (Greiner, Frickenhausen, Germany) was used for 
carrying out the pNCA test in a microtiter plate. In a total 
reaction volume of 250 ixl in Tris/HCl buffer (0.1 M, pH 8.2), the 
reaction batch contained either 60 nmol 8-pNCA, 15 nmol 10-pNCA, 
15 nmol 12-pNCA or 15 nmol 15-pMCA, in each case dissolved in 

25 2.5 ^1 of DMSO. After the samples had been preincubated for 
5 minutes with 40 |Jil of P450 BH-3 san^les, the reaction was 
started by injecting, into each well, 30 |il of a 1 xnM HADPH 
solution. Immediately after the NADPH solution had been added, 
the plates were measured in a plate reader at 405 nm. 

30 

MeiihQd 6g Conversion of ca rboxyllc acids using mutants according 
a) Chemical reaction 

35 

The following batch was chosen to hydroxylate the fatty acids in 
the presence of P450 BM-3 or its mutants: 

P450 BM-3 mutant 20 mg 

40 Reaction buffer 20 ml (TriB/HCl 50mM, KCl 250mM, pH 7.8) 

Fatty acid 10 mg 

Acetone 400 |jil (2 % v/v) (accelerates the 

reaction by a factor of 2) 



45 
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Prior to the reaction, the enzyme lyophilisate was dissolved in 
1 ml of reaction buffer and first incubated for 30 minutes at 
36^C. Like the addition of 2% v/v acetone, the incubation leads to 
an increase in activity of 60 and 75%, respectively. 

5 

After 5 minutes' incubation, 500 of the previously prepared 
NADPH solution (12.5 mg/ml) were added. The course of the 
reaction is monitored by absorption measurements at 340 nm, and 
the consumption of NADPH can be observed. Theoretically 4 ml of 
10 NADPH solution would be required for a stoichiometric conversion, 
however, an unduly high NADPH concentration in the reaction 
solution results in inactivation of the enzyme, which is why the 
cof actor is added in 500 h*1 steps. 

15 After the reaction had ended, the solution was acidified with 5H 
hydrochloric acid to a pH of 2 . The fatty acids or hydroxylated 
fatty acids precipitate during this and result in a noticeable 
cloudiness of the solution. Thereafter, the mixture was extracted 
twice with in each case 10 ml of dichloromethane and dried over 

20 sodium sulfate. After filtration through a folded filter, the 
dichloromethane was removed on a rotary evaporator or by 
evaporation with nitrogen. The white solid which remained was 
taken up in 2 ml of dichloromethane and used in the GC analysis • 

25 b) Analysis by gas chromatography 

A Fisons gas chromatograph (Fisons Instruments Mega Series, 
Mainz, Germany) equipped with FID was used for the analysis 
(Optima 5 -column, internal diameter 25 m x 0.25 iran, Macherey & 
30 Nagel, Diiren, Germany). 

For analysis, starting materials and products were silylated with 
MSHFBA* During this, all hydroxyl and acid groups are converted 
into the corresponding trimethylsilyl ethers and esters, 
35 respectively. Care must be taken that the 8€un:iple is anhydrous 
since otherwise side reactions with the silylating agent result. 
15 |Jil of MSHFBA were pipetted into 10 (il of the 

hydroxy* f at ty-acid*containing dichloromethane solution and the 
mixture was incubated for 15 minutes at room temperature. After 
40 25 fil of dichloromethane had been added, the GC analysis was 
carried out. 

To this end, 1 \xl of the silylated samples were injected into the 
gas chromatograph. Injector and detector temperature were 350°C. 
45 The following temperature programmes were used: 
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Caprylic acid 
100 (2 min) 



5°/inin 



200 OC 



10<=^/niin 



^ 300 oc (5 min) 



g Capric acid 

100 oc (2 min) 5o/min ^ oc (5 min) 1S1^^^30Q oc (5 min) 



10 



Laurie 

100 oc (2 min) 5o/min ^ ^ (5 min) 10^/"*^^^ 300 oc (5 min) 



Example 1 = Seleci:iQn of the starting mutant 

The P450 BM-3 single mutant F87A was used as starting template. 
As already mentioned, this mutation shifts the hydroxylation 
position in saturated C12* and C14-£atty acids of (o-2 and 

03-3 towards the co-position (13). Tor the conversion of 12-pNCA 
and 15-pNCA/ it was also shown that the co-hydroxylation is 
20 enhanced significantly over the wild type (12)* As regards the 
(D-hydroxylation of 12-pNCAr a complete conversion was obtained 
compared with a 33% conversion in the case of the wild type. In 
addition to the increased regioselectivity, increased activities 
for 12-pNCA and 15-pHCA were also observed (compare Table 2). 

Table 2: Specific activity of selected P450 BM-3 mutants for 
various pNCA derivatives with different chain lengths^ 



30 



Sub- 
strate 


WT 


F87A 


L188K 


V26T 


R47F 


S72G 


A74G 


M354T 


15-pNCA 


405 


410 


288 


519 


258 


439 


474 


560 


12-pNCA 


141 


284 


316 


555 


233 


596 


517 


480 


10-pNCA 


339 


92 


207 


106 


52 


150 


103 


171 


8-pNCA 


15 


2 


69 


16 


13 


3 


6 


4 



1 Units of specific activity s nmol/min/nmol P450 

The increased regioselectivity of F87A can be understood with 
reference to the structural model* The substitution of the bul)cy 
residue F87 by alanine enlarges the binding pocket for the 
p-nitrophenoxy group formed by F87, and also by V78, A82, T260, 
1263 and A264. In addition, the access of the large 
p-nitrophenoxy group to the binding pocket is facilitated and the 
steric interactions between F87 and the carbon atoms co-2 and <o-l 
of the pNCA are eliminated. This allows a more advantageous 
orientation of the co-position relative to the heme iron atom. 
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Example 2; Selection of individual mutiatiion pogitions by 
modeling, site-specific randomizatiion mutiagenegia of selected 

positions > and screening 
5 1. Selection of the mutation positions 

The model of the bound CS-pNCA substrate shows that the 

carboxyl£ite is at a distance of 9 and 11 k from the 

carboxy late-binding residues R47 and Y51, respectively. To induce 

10 activity for C8 substrates, it is necessary to generate a new 

binding site which is at a suitable distance from the carboxylate 
group of the substrate. Additional residues should be positioned 
within the binding site to facilitate the binding of S-pNCA. The 
following residues were selected: the residues R47 and Y51, which 

15 form the original carboxylate binding site. For R47, it is 

proposed to form an ion pair between its guanidinium group and 
the carboxylate residue of the substrate; YSl forms a hydrogen 
bridge with the carboxylate group of the substrate (9). In the 
palmitoleic acid/P450 BM-3 complex, the distance between the 

20 atom of R47 and the C atom of the fatty acid carboxylate group is 
12 A. All residues in the hemisphere with a 12 A radius around 
the carboxylate group of the 8-pNCA model were identified, and 
only those were selected within the binding pocket which have 
side chains pointing towards the carboxylate group of the fatty 

25 acid. In this manner, P25, V26, S72, A74, L188 and M354 were 
selected, and these are probably capable of forming a new 
carboxylate binding site for short-chain pNCA compounds. These 
six selected residues are located on secondary structural 
elements with a flexible conformation (11). 

30 

2. Mutation and screening 

Mutants of each of the eight selected residues were generated by 
site-specific randomization of the wild-type codon at the 

35 positions in question. To ensure that most of the 19 possible 

amino acid species are tested, 100 colonies of each position were 
isolated, cultured and tested for activity (the probability that 
each amino acid is tested is thus greater than 95%, with the 
exception of a probability of 79% for tryptophan and methionin) . 

40 The mutants with a higher activity value for at least one 

substrate with a shorter chain length than 15-pNCA were selected 
for sequencing for identifying the mutation(s). 

It emerged that position 188 is relatively variable. Most of the 
45 100 colonies showed activity for pNCA. From these, 37 colonies 
were selected in accordance with their activity for 8-pl9CA. 16 
different amino acid types inclusive of the wild type were 
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de-tected. This result furthermore confirmed that the selection of 
100 colonies was sufficient to ensure that most of the 20 amino 
acids which are possible can be tested. Amongst the 15 
substitutions, substitutions R, W, G, A and S 

5 significantly increased the catalytic activity for substrates of 
shorter chain lengths . The substitution of L by negatively 
charged amino acids led to an activity for 10-pNCA, l2-pNCA and 
15-pNCA which was reduced by a factor of three to seven. 

10 The remaining seven positions were randomized in a manner similar 
to position 188. 7 to 19 colonies were selected from each 
position for DNA sequencing for detecting the mutation(8). Most 
of these genes were either unmutated, or else the gene, product 
showed reduced activity for substrates of shorter chain lengths 

15 (the tests were carried out with pure enzyme). In each case only 
one mutant of positions 26, 47, 74 and 354, two mutants of 
position 72 and no mutants of positions P25 and YSl showed a 
higher activity for substrates of shorter chain length in 
comparison with 15-pNCA, The mutant with the highest activity for 

20 8-pNCA in position 188 and position 72, and also single mutants 
of positions 26, 47, 74 and 354 were selected for a combination. 
These mutations, which are listed in the eibove Table 2, contain 
the following substitutions: V26T, R47F, S72G, A74G, L188K and 
M354T. In comparison with F87A, the specific activity for 8-pNCA 

25 was increased by the factor 0.5 to 33.5. Each of these mutations 
was combined stepwise with the original mutation F87A. 

Rvample 2= M u H-iple mutants generated bv steDwise combination Of 

mutants , and their screening 

30 

1. Mutant I. (F87A, L188K) 

The selection of the first mutant is critical since the 
combination of single mutants does not necessarily result in a 

35 cumulation of the individual effects. The wrong selection of the 
first mutation might lead towards an evolutionary pathway which 
does not result in an optimal multiple mutation. In the case of e 
large starting pool of single mutants, an enormous combinatory 
diversity is likely and, as a consequence, finding the desired 

40 multiple mutant is unlikely. 

Two criteria favor the selection of mutant Ii as starting 
material: its increased activity for 8-pNCA compared with all 
other mutants (Table 2), and the changes in the enzyme/ substrate 
45 interaction and the properties of the binding site. To verify 
these changes, structural models of the selected mutants were 
generated by replacing the side chains of the mutation positions 
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with the aid of the SYBYL method. L188 is located on the 
C terminus of the a-helix F. This helix and the vicinal G helices 
experience the greatest shift by substrate binding (9). The model 
illustrates that the exchange of leucine for lysine leads to the 
5 formation of a novel putative carboxylate binding site. The 
distance between the amino group of K188 with the carboxylate 
residue of 8-pNCA is 6 A and thus comparable with the distance of 
the carbbxylate group of palmitoleic acid and of the guanidinium 
group from R47 (the original binding residue), which value has 

10 been determined experimentally. The distance between the 

guanidinium group of R4 7 and the carboxylate group of S-pNCA is 
11-4 A. It can therefore be proposed that the ion-pair reactions 
between the carboxylate group of the substrate and K188 together 
with the hydroxylated C8 atom take place in a reactive distance 

15 to the heme iron atom. 

The other five mutants were combined stepwise with mutant I. by 
site-specific mutagenesis (compare Figure 1). A structural model 
of the mutants selected was generated for each mutagenesis step. 
20 The effects, of the mutants generated, on substrate binding was 
studied with reference to this model, and, on this basis, a 
rational explanation for the optimized properties of the mutants 
and their combinations was proposed. 

25 2. Mutant LA (F87A, L188K, A746) 

Catalytic efficiency over 8-pNCA was increased by introducing 
mutation A74G into mutant L. A74 is located on the N- terminus of 
the a-helix B'. Since the side chain of A74 sterically interacts 
30 with the amino group of K188, this interaction is eliminated by 
exchanging A74 for glycine, thus allowing a more advantageous 
orientation of the side chain relative to the carboxylate group 
of the substrate. 

35 3. Mutant lAR {F87A, I1I88K, A74G, R47F) 

The catalytic efficiency for 8-pNCA was successfully improved by 
introducing the additional mutation R4 7F. The mutation R47F had 
two possible effects. Phenylalanine hinders the original 
40 carboxylate binding and enlarges the hydrophobic section, which 
is exposed to the solvent, at the entrance of the binding channel 
formed by the residues Fll, L14, L17, PIS, P45 and A191 (compare 
Figure 2). This hydrophobic section is particularly important for 
attaching the substrate (11). 
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4. Mutant LARV <F87A, L188K, A74G, R47Fr V26T) 

If the mutation M354T is added to mutant lAR, the Kcat value for 
8-pCNA is reduced. M354T was therefore not involved in further 
5 combination experiments. However, if the mutation V26T is added 
to mutant lARr the resulting mutant LARV shows a slightly higher 
Kcat value and a slightly lower Km value than UVR. The catalytic 
efficiency for 8-pNCA was increased. If r instead, mutation S72G 
is carried out in mutant LAR, the Kcat value of B-pNCA was 
10 reduced. This is why mutant LARV was selected for further 
mutations • 

V26 is located at the N-terminus of the a-helix A* The model 
(Suggests that T26 might assume the role of the original residue 

15 Y51 of the binding site and might stabilize the carboxylate group 
of the substrate by forming a hydrogen bridge. The distance 
between the hydroxyl group of T26 and the amino group of K188 is 
6.9 A. Compared with the original • binding site, this distance is 
approximately 2 k greater (4.8 A between the hydroxyl group of 

20 YSl and the guanidinium group of R47). It is therefore possible 
that the K188-containing F helix is subjected to further 
conformational alterations, owing to which 8-pNCA is retained 
deeper down in the binding region, and the distance between T26 
and K188 is reduced. 

25 

5^ Mutant LARVF (F87V, LlSSK, A746, R47F, V26T) 

It was possdLble to demonstrate that position 87 is of particular 
importance for the catalytic activity and the substrate 
30 specificity of the enzyme (3, 13). This is why mutant lARV was 
optimized in position 87 by site-specific randomized mutagenesis. 
Amongst 100 colonies, one mutant was obtained which exhibited a 
catalytic efficiency of 3.5*10* s-^ M-^ for 8-pNCA. The DNA 
sequence data revealed that alanine in position 87 was replaced 

35 by valine. 

It is known from the crystal structure (11) and earlier 
experiments (3, 13) that R87 is important for the access of the 
substrate to the heme iron. The bulky para-nitrophenyl group of 
40 this substance made it necessary to increase the size of the 
binding site by exchanging phenylalanine for alanine. By 
exchanging AB7 for valine, the contact of Co, with the heme iron 
atom is improved, owing to steric interactions between the ether 
oxygen and the V87 side chain. 
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The data reveal that two key steps, F87A to L and LARV to liARVF, 
increased the catalytic efficiency for 8-pNCA, This raised the 
question as to whether the other mutations in mutant LARVF can be 
reversed without losing the activity for 8-pNCA. Mutants were 
5 therefore generated which contain mutation F87V instead of 
mutation F87A. These mutants have the following names: 

1. ) L7V with the mutations F87V LISSK 

2. ) AL7V with the mutations F87V L188K A74G 

10 3. ) ARI*7V with the mutations F87V L188K A74G R47F 

The following results were obtained in the activity assays. 

Table 3: Kinetic parameters of P450 BM-3 mutants for pNCA 

15 derivatives of different chain lengths, determined at pH 8-0 and 
25c 



20 





Kc 


at (s"i) 


Km (^) 


Kcat/Km (s-iM-l) 




12 


10 


8 


12 


10 


8 


12 


10 ' 


8 


F87V 


0.7 


1.8 




3.8 


8.5 




1.7«105 


2.1-105 




L7V 


2.8 


4.7 




6.4 


14.2 




4.3-105 


3-3-105 




AL7V 


2.2 


5.9 


4.3 


15.1 


22.7 


197.6 


1.4* 105 


2.6-105 


2.0-10* 


ARL7V 


1.4 


5.5 


3.9 


8.9 


17.5 


41.4 


1.7*105 


3.1-105 


9.3-10* 


ARVLF 


1.4 


7.2 


0.2 


12.3 


44.8 


6.5 


9.2-10* 


1.6-105 


3.5-10* 



conversion for the substrate 8-pNCA. The results furthermore show 
that the quadruple mutant ARL7V shows an even better catalytic 
efficiency than the quintuple mutant ARVLF. 



30 



Moreover, mutants ARL7V and ARVLF show high activities for capric 
acid (CIO), which is shorter by two carbon atoms than the 
shortest native fatty acid substrate of the wild-type enzyme. 
Moreover, co-4-monohydroxylated products were observed when lauric 
35 acid (C12) is used as substrate, while the wild-type enzyme of 
P450 BM-3 is only active in the positions co-l,(o-2and qj-3. 

EXWBPle 4 s Determination of the or-^^f^^T-r-^rl hydrnvY t atlon pniciit ^ i^n 
for CarbPXVliC acida of different chAin l e notha and vairinii« 
40 mutants, and comparia on with the wild- type enzyme 

The reaction batches are worked up and analyzed in accordance 
with the above-described method (6). 



45 
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The results are coiapiied in Table 4 below 



Table 4 : 



• 

1 Hydroxy 1 at ion 
positions 


Capric acid (CIO) 


Laurie acid (C12) 

• 








1 Wild type 




• 


1 (0-3 


- 


34 




- 


28 


1 €0-1 


- 


38 


1 Mutant I*7V 


• 




1 CD-3 


- 


53 


1 co-2 


- 


30 


1 co-1 




17 


1 Mutant F87V 




• 


j fO-3 


- 


51 




- 


25 


1 CO-1 


— 


24 


1 Mutant AL7V 




- 


j co-3 




28 


1 co-2 




54 


1 co-1 




18 


1 Mutant ARI.7V 






1 ^o-3 


14 


35 




33 


50 


1 oj-l 


53 


15 


I Mutant ARVI*F 






1 co-3 


15 


34 


1 co-2 


30 


53 


1 co-1 


55 


13 



10 



15 



20 



30 



35 



What is noticeable in the hydroxylation of lauric acid is that a 
change in regioselectivity first takes place during the 
transition from the wild type to P450 BM-3 F87V^ which 
40 preferentially catalyzes an co-3 hydroxylation, as is the case with 
P450 BM-3 Ii7V. During the transition to the triple mutant, the 
regioselectivity changes again; P450 BM-3 AIi7V, P450 BM-3 ARI.7V 
and P450 BM-3 ARVLF preferentially direct the hydroxylation to 
the co-2 position. 

45 
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All GC analyses demonstrate that capric acid is only converted by 
P450 BM-3 ARVUP and P450 BM-3 ARL7V, while a conversion of less 
than 1% is found in the other mutants. Both enzymes exhibit a 
virtually identical regioselectivity^ with the ca-1 position being 
5 preferred. 

To determine the reaction yields, gas chromatograms of standards 
were recorded. The starting material standard used was capric or 
lauric acid solution with dichloromethane as solvent 

10 (concentration 0.5 mg/ml). Owing to the FID detection method, it 
was not possible to simply compare the peak areas of starting 
materials and products with one another since molecules which 
have different structural and empirical formulae generate 
different ion fluxes upon combustion. These ion fluxes are not 

15 proportional to the quantitative ratio of starting material and 
product. The product standard used was therefore commercially 
available 10-hydroxycapric acid or 12-lauric acid- The empirical 
formulae of the standards and of the products are identical, and 
the structure only differs with regard to the position of the 

20 hydroxy 1 group, which is why approximately identical detectable 
ion fluxes can be assumed for identical quantities. 

In the hydroxylations of capric acid with P450 BM-3 ARVLF and 
P450 BM-3 ARXt7V catalysis, the cumulative product yields were 57% 
25 and 38%, respectively. In the hydroxylations of lauric acid, the 
yield was 51% for P450 BM-3 ARVLF catalysis and between 38% and 
40% for all other mutants and for the wild type. 



30 
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40 
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SEQUENCE LISTING 

<110> BASF Aktiengesellschaft 

<120> Modified cytochrome P450 monooxygenases 

<130> H/40434 

<140> 
<141> 

■ 

<160> 20 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 3150 
<212> DNA 

<213> Bacillus megateriuni 

<220> 

<221> CDS 

<222> (4). .(3150) 

<400> 1 

atg aca att aaa gaa atg cct cag oca aaa acg ttt gga gag ctt aaa 48 
Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu I«eu Lys 
15 10 15 

aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 96 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 

20 25 30 

att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 144 
lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 

35 40 45 

gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tge gat 192 
Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 

50 55 60 

gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 240 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 

gat ttt gca gga gac ggg tta ttt aca age tgg acg eat gaa aaa aat 288 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
80 85 90 95 

tgg aaa aaa gcg cat aat ate tta ctt cea age ttc agt cag cag gca 336 
Trp Lys Lys Ala His Asn Xie Leu Leu Pro Ser Phe Ser Gin Gin Ala 

100 105 110 

atg aaa ggc tat cat gcg atg atg gtc gat ate gee gtg cag ctt gtt 384 
Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 

115 120 125 

caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 432 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 
130 135 140 

gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 480 
Asp Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn 
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145 



150 



155 



tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 528 
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 
160 165 170 175 

agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga goa 576 
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 

180 185 190 

aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa 624 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 

195 200 205 

gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 672 
Asp He Lys Val Met Asn Asp Leu Val. Asp Lys He He Ala Asp Arg 
210 215 220 

aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 720 
Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 

gga aaa gat cca gaa acg ggt gag ceg ett gat gac gag aac att cgc 768 
Gly i.ys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Xle Arg 
240 245 250 255 



tat caa att 
Tyr Gin He 



aca ttc tta att geg gga eac gaa aca aca agt ggt 
Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly 
260 265 270 



816 



ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 

275 280 285 



864 



gca gaa gaa gca gca cga gtt eta gta gat cct gtt 
Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val 
290 295 300 

age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu 
305 310 315 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 
320 325 330 335 

aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 

340 345 350 

gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 
Glu Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp 

355 360 365 



cca 912 



aac 960 



1008 



1056 



1104 



gga gac gat gtg gaa gag ttc cgt 
Gly Asp Asp Val Glu Glu Phe Arg 
370 375 

gcg att ceg cag cat gcg ttt aaa 
Ala He Pro Gin His Ala Phe Lys 
385 390 



gag cgt ttt gaa aat 
Glu Arg Phe Glu Asn 

380 

ttt gga aac ggt cag 
Phe Gly Asn Gly Gin 
395 



cca agt 1152 



cgt gcg 1200 
Arg Ala 



tgt ate ggt cag cag ttc get ctt eat gaa gca acg ctg gta ctt ggt 1248 
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Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
400 405 410 415 

atg atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg 
Mot Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 

420 425 430 

gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 
Asp lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 

435 440 445 

gca aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act 
Ala Lys Ser Lys Lys lie Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 

gaa cag tct get aaa aaa gta cgc aaa aag gca gaa aac get cat aat 
Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 

acg ccg ctg ctt gtg eta tac ggt tea aat atg gga aea get gaa gga 
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
480 485 490 495 

acg gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg 
Thr Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 

500 505 510 



cag gtc gca acg ctt gat tea cac gcc gga 
Gin Val Ala Thr Leu Asp Ser His Ala Gly 

515 520 



get gta tta att gta acg gcg tct tat aac ggt 
Ala Val Leu He Val Thr Ala Ser Tyr Asn Gly 
530 535 



ctt ccg cgc gaa gga 
Leu Pro Arg Glu Gly 
525 



ccg cct gat aac 
Pro Pro Asp Asn 
540 



gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 
Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 

aaa ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get 
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
560 565 570 575 

act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gcc get 
Thr Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala 

580 585 590 

aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 
Lys Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 

595 600 605 

gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
6X0 615 620 

gta gca gcc tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 
Val Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lvs 
625 630 635 

tct act ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt 
Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
640 645 650 655 

gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 
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Ala I-ys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 

660 665 670 

ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 

675 680 685 



ctt 
Leu 



aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 
Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 



cot cgc aac tat gaa gga ata gta aae egt gta aea gea agg tte ggc 
Pro Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 



eta gat gca tea eag caa ate 
Leu Asp Ala Ser Gin Gin tie 
720 725 



cgt ctg gaa gca gaa gaa gaa aaa tta 
Arg Leu Glu Ala Glu Glu Glu Lys Leu 

730 735 



get cat ttg 
Ala His Leu 



etc get aaa aca gta tec gta gaa gag ctt ctg caa 
Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 
740 745 750 



tac gtg gag ctt caa gat 
Tyr yal Glu Leu Gin Asp 

755 



gtt acg cgc acg cag ctt cgc gca atg 
Val Thr Arg Thr Gin Leu Arg Ala Met 
760 765 



get get aaa acg gtc tgc ccg ceg cat aaa gta gag ctt gaa gee ttg 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

ctt gaa aag caa gee tac aaa gaa caa gtg ctg gca aaa cgt tta aca 
Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 

atg ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age 
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
800 805 810 815 

gaa ttt ate gee ctt ctg cca age ata cgc ccg cgc tat tac teg att 
Glu Phe He Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He 

820 825 830 

tct tea tea ect cgt gtc gat gaa aaa caa gca age ate acg gtc age 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser 

835 840 845 

gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He 
650 855 860 



gcg teg aac tat ctt gee gag ctg caa gaa gga gat acg att acg tgc 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys 
865 870 875 

ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa 
Phe He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
880 885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga 
Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 

900 90S 910 



2064 



2112 



2160 



2208 



2256 



2304 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt 



2784 
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Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 

915 920 925 

gga gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat 2832 
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

ctg tat caa gaa gag ctt gaa aac gcc caa age gaa ggc ate att acg 2880 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr 
945 950 955 

ctt cat acc get ttt tct cgc atg eca aat eag ecg aaa aea tae gtt 292 8 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
960 965 970 975 

eag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 2976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 

980 985 990 

caa gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct 3024 
Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 

995 1000 1005 

gee gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 3072 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 3120 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 

cga tac gca aaa gac gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 

<210> 2 
<211> 1048 
<212> PRT 

<213> Bacillus megaterium 
<400> 2 

Thr He Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
15 10 15 

Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys He 

20 25 30 

Ala Asp Glu Leu Gly Glu He Phe Lys Phe Glu Ala Pro Gly Arg Val 
35 40 45 

Thr Arg Tyr Leu Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp Glu 
50 55 60 

Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp 
65 70 75 80 

Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp 

85 90 95 

Lys Lys Ala His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 

100 105 110 

Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val Gin 
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H5 120 125 

Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu Asp 
130 135 140 

Met Thr Arg Leu Tbr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn Tyr 
145 150 155 160 

Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr Ser 

165 170 175 

Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn 

180 185 190 

Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 
195 200 205 

lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg Lys 
210 215 220 

Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
225 230 235 240 

Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg Tyr 

245 250 255 

Gin lie He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu 

260 265 270 

Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 280 285 

Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
290 295 300 

Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 310 315 320 

Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 

325 330 335 

Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 

340 345 350 

Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp Gly 

360 365 



Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
370 375 380 

He Pro Gin His Ala Phe Lys. Pro Phe Gly Asn Gly Gin Arg Ala Cys 
385 390 395 400 

He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 

405 410 415 

Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 

420 425 430 

He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
435 440 445 

Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 
450 455 460 
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Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465 470 475 480 

Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 

485 490 495 

Ala Arg Asp Leu Ala Aap He Ala Met Ser Lys Gly Phe Ala Pro Gin 

500 505 510 

Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
515 520 525 

Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
530 535 540 

Lys Gin Phe Val Asp Trp Leu Aap Gin Ala Ser Ala Asp Glu Val Lys 
545 550 555 560 

Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Aap Lys Asn Trp Ala Thr 

565 570 575 

Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys 

580 585 590 

Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
595 600 605 

Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 615 620 

Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys Ser 
625 630 635 640 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

645 650 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 

660 665 670 

Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He Pro 
690 695 700 

Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705 710 715 720 

Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 

725 730 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 

740 745 750 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 760 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 800 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 
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805 810 815 

Phe He Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He Ser 

820 825 830 

Ser Ser Pro Arg Val* Asp Glu Lys Gin Ala Ser He Thr Val Ser Val 
835 840 845 

Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He Ala 
850 855 8.60 

Ser Asn Tyr Leu Ala Glu Leu Gla Glu Gly Asp Thr He Thr Cys Phe 
865 870 875 880 

He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 

885 890 895 

Pro Leu He Mel: Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 

900 905 910 

Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr Leu 
945 950 955 960 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 

965 970 975 

His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp Gin 

980 985 990 

Gly Ala Hia Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val Ser 
1010 1015 1020 

Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly Arg 
1025 1030 1035 1040 

Tyr Ala Lys Asp Val Trp Ala Gly 

1045 



<210> 3 
<211> 30 
<212> DMA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PCR primer 
<400> 3 

gcaggagacg ggttgnnnac aagctggacg 

<210> 4 
<211>. 30 
<212> DNA 
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<213> Ar-tificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 4 

cgtccagctt gtnnncaacc cgtctcctgc 

<210> 5 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequences PGR primer 
<400> 5 

gaagcaatga acaagnnnca gcgagcaaat ccag 

<210> 6 
<211> 34 
<212> DHA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 6 

ctggatttgc tcgctgnnnc ttgttcattg cttc 

<210> 7 
<211> 41 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 7 

gctttgataa aaacttaaag tcaannnctt aaatttgtac g 

<210> 8 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 8 

cgtacaaatt taagnnnttg acttaagttt ttatcaaagc 

<210> 9 
<2H> 37 
<212> DNA 

<213> Artificial sequence 
<220> 

<2 23> Description of the artificial secjuence: PGR primer 
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<400> 9 

gt.ta'ttAaac acagataaan nngt-tcaagc t.^t:gat;g 

<210> 10 
<211> 37 
<212> DKA 

<213> Ar1:lficial sequence 
<220> 

<223> Description of the artificial sequences PGR primer 
<400> 10 

catcaaagct tgaacnnntt tatctgtgtt taataac 

<210> 11 
<211> 37 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR prlzner 
<400> 11 

gttattaaac acagataaac cgnnncaagc tttgatg 

• 

<210> 12 
<211> 37 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 12 

catcaaagct tgnnncggtt tatctgtgtt taataac 

13 . 
33 
DNA 

Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 13 

cgaggcgcct ggtnnngtaa cgcgctactt ate 



<210> 
<211> 
<212> 
<213> 



<210> 14 
<2H> 33 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR-prlmer 
<400> 14 

gataagtagc gcgttacnnn accaggcgcc teg 



<210> 15 
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<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Descrlp'blon of the artificial sequence: PGR primer 
<400> 15 

cctggtcgtg taacgcgcim nttatcaagt cage 

« 

<210> 16 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Description of the artificial sequence: PGR primer 
<40D> 16 

gctgacttga taannngcgc gttacacgac cagg 

<210> 17 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 17 

gctttgataa aaacttannn caagcgctta aatttgtacg 

<210> 18 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 18 

cgtacaaatt taagcgcttg nnntaagttt ttatcaaagc 

<210> 19 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PGR primer 
<400> 19 

ggcgacgaac tannngttct gattcctcag 

<210> 20 
<211> 30 
<212> DNA 

<213> Artificial sequence 



<220> 



« 
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<223> Description of the artificial sequence: PCR primer 
<400> 20 

ctgaggaatc agaacnnnta gttcgtcgcc 
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1. A modified cytochrome P4 50 monooxygenase which, in comparison 
5 with the wild-type enzyme, shows an altered substrate profile 

in the terminal and/or subterrainal enzymatic hydroxylation of 
aliphatic carboxylic acids, owing to site-specific 
mutagenesis of its substrate binding region. 

10 2. A monooxygenase as claimed in claim 1, which is derived from 
cytochrome P450 monooxygenases of bacterial origin. 

3. A monooxygenase as claimed in claim 2, which is derived from 
Bacillus megaterium cytochrome P450 monooxygenase BM-3 with 
an amino acid sequence in accordance with SEQ ID NO: 2, which 
has at least one functional mutation in one of the following 
amino acid sequence regions: 24-28, 45-51, 70-72, 73-82, 
86-88, 172-224 und 352-356, with the proviso that, if the 
enzyme carries the mutation F87A, more than one of these 
20 regions is mutated. 

4- A monooxygenase as claimed in claim 3, which comprises at 
least one functional mutation in the amino acid sequence 
regions 86-88 and 172-224. 

25 

5. A monooxygenase as claimed in claim 4, which comprises at 

least one of the following amino acid substitution patterns: 



30 



40 



6 



a) 


F87V? 






b) 


F87A 


LiaSK; 






c) 


F87V 


liiaSK; 






d> 


F87A 


L188K A74G 


• 

/ 




e) 


F87V 


L1B8K A74G 


• 




f> 


F87A 


L188K A74G 


R47F; 




g> 


F87V 


L188K A74G 


R47F; 




h) 


F87A 


L188K A74G 


R47F 


V26T; or 


i) 


F87V 


L188K A74G 


R47F 


V26T; 



and functional equivalents thereof. 

A monooxygenase as claimed in claim 3, which comprises a 
single amino acid substitution from amongst the following: 



a) V26T, 

45 b) R47F, 

c) S72G, 

d) A74G, 
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e) F87V, 

£} Xj188z, where Z is an amino acid selected from amongst K, 

R, W, Q, N, G, A and S, and 

g) M354T; 

5 - ' • 

and functional equivalents thereof. 

7« A nudleic acid sequence encoding a monooxygenase as claimed 
in any of the preceding claims and the complementary nucleic 
10 acid sequence thereof. 

8. An expression construct comprising, under the genetic control 
of regulatory nucleic acid sequences/ an encoding sequence 
which encompasses a nucleic acid sequence as claimed in claim 
15 7. 

A vector which encompasses at least one expression construct 
as claimed in claim 8. 

A recombinant microorganism which has been transformed with 
at least one vector as claimed in claim 9. 

A microorganism as claimed in claim 10, selected from amongst, 
bacteria of the genus Escherichia. 

A process for the enzymatic production of terminally or 
subterminally hydroxy lated aliphatic carboxylic acids, which 
comprises 

al ) culturing a recombinant microorganism as claimed in claim 
10 or 11 according to the invention in the presence of a 
culture mediiim which contains at least one hydroxylatable 
carboxylic acid or at least one hydroxylatable carboxylic 
acid derivative; or 

a2) incubating a reaction medium containing at least one 
hydroxylatable carboxylic acid or at least one 
hydroxylatable carboxylic acid derivative with an enzyme 
as claimed in any of claims 1 to 6, and 

b) isolating the resulting hydroxy lated product from the 
medium. 

13. A method as claimed in claim 12, wherein the hydroxylatable 
45 carboxylic acid is a C8-C30 monocarboxylic acid or a 
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14. A method as claimed in claim 13, wherein the hydroxylatable 
carboxylic acid is a Ce-Cis-monocarboxylic acid or a 
derivative thereof and the monooxygenase used is a mutant as 
claimed in claim 5. 

5 

15. A method as claimed in claim 13, wherein the hydroxylatable 
carboxylic acid is a Cx2-C3o-monocarboxylic acid or a 
derivative thereof and the monooxygenase employed is a mutant 
selected from amongst the single mutants F87A, F87V, V2ST, 

10 S72G, A74G and M354T, and the multiple mutants 

F87A I*188K A74G R47F; 
F87V L188K A74G R47F; 
F87A i:il88K A74G R47F V26T; or 
F87V ril88K A74G R47F V26T. 

15 

16. A method as claimed in any of claims 12 to 15, wherein the 
reaction is carried out in the presence of an electron donor 
or a reduction equivalent. 

20 17. A method as claimed in claim 16, wherein the electron donor 
or the reduction equivalent is selected from amongst NADH, 
NADPH and Zn/Co(XIX) sepulchrate. 
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Helix A 




Fig.2 
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